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The use of an approximate reference state wave function |"I>r) in electronic many-body methods can break 
the spin symmetry of Born-Oppenheimer spin-independent Hamiltonians. This can result in significant errors, 
especially when bonds are stretched or broken. A simple spin-projection method is introduced for auxiliary-field 
quantum Monte Carlo (AFQMC) calculations, which yields spin-contamination-free results, even with a spin- 
contaminated j$r)- The method is applied to the difficult Ft molecule, which is unbound within unrestricted 
Hartree-Fock (UHF). With a UHF |$r), spin contamination causes large systematic errors and long equilibra- 
tion times in AFQMC in the intermediate, bond-breaking region. The spin-projection method eliminates these 
problems, and delivers an accurate potential energy curve from equilibrium to the dissociation limit using the 
UHF i$r). Realistic potential energy curves are obtained with a cc-pVQZ basis. The calculated spectroscopic 
constants are in excellent agreement with experiment. 

PACS numbers: 71.15.-m,02.70.Ss,31.25.-v,31.15.Ar 



I. INTRODUCTION 

A standard approach in many-body electronic structure 
methods is to obtain ground and excited state energies from 
an approximate reference state wave function |$r)- For 
example, the coupled-cluster (CC) approximation with sin- 
gle, double, and perturbative triple excitations [CCSD(T)], 
which is widely available in quantum chemistry computer 
codes, typically uses a Hartree-Fock (HF) single-determinant 
l^r)-^ Ground state quantum Monte Carlo (QMC) stochas- 
tic methodsriMi^ which are exact in principle, use projec- 
tion from any |<I>r) that has non-zero overlap with the ground 
state wave function (WF). In practice, however, the Fermionic 
sign problemi^i^i^ must be controlled to achieve accurate 
results. Diffusion QMC (DMC) uses a single- or multi- 
reference WF to impose approximate Fermionic nodal bound- 
ary conditions in real space and also includes a Jastrow fac- 
tor to reduce the stochastic variance. The recently developed 
phaseless auxiliary-field quantum Monte Carlo (AFQMC) 
method^i2ii£iii is an alternative and complementary QMC ap- 
proach, which samples the many-body wave function with 
random walkers in the space of Slater determinants. AFQMC 
provides a different route to controlling the sign problem, us- 
ing the complex overlap of the walkers with which is 
frequently just a single HF determinant. Like the CC method, 
the AFQMC method works in a chosen single-particle basis, 
and it has been successfully applied using Gaussia n'"''^''^''"^ 
and plane wav e^A" basis sets. 

While these correlated methods are generally quite accurate 
near equilibrium geometries, the use of an approximate |$,.) 
can introduce uncontrolled errors as bonds are stretched or 
broken i'^''^''^''^ The main reason for this is that correlation 
effects become increasingly important in the transition region 
where a system begins to dissociate into its fragments, which 
are themselves often open shell systems. The quality of 
typically degrades in this region, since it is derived from a 
simple level of theory. A second reason is the breaking of 
spin or spatial symmetries in these simple reference WFs. 

In previous applications, phaseless AFQMC with an unre- 



stricted Hartree-Fock (UHF) single-determinant 1$^™) was 
found to often give better overall and more uniform accu- 
racy than CCSD(T) in mapping the potential-energy curve 
(PEC) i'°''"'^ In some cases, however, such as the BH and 
N2 molecules, achieving quantitative accuracy of a few mSj, 
for the entire PEC required multi-determinant 1$^)'^ In these 
cases, spin contamination did not appear to be a major source 
of the error seen in the calculations with UHF reference 
states. In this paper, we show that, with a single-determinant 
l^™*"), the AFQMC potential energy curve of the difficult F2 
molecule is qualitatively incorrect in the intermediate disso- 
ciation region. Spin contamination of |$™^) is found to be 
the dominant factor for this error. We describe a simple spin- 
projection method to effectively remove spin-contamination 
effects. 

With 1$™^) and the spin-projection method, the AFQMC 
results of F2 are shown to be accurate (within a few mE^ of 
the near-exact CCSDTQ result) across the entire PEC. One of 
the main appeals of QMC methods is that the computational 
cost typically scales with systems size as a low power Us- 
ing larger basis sets (cc-pVTZ and cc-pVQZ), we then obtain 
realistic PECs and spectroscopic constants and compare them 
with experimental results. 

The remainder of the paper is organized as follows. Sec- 
tion [n] discusses the difficulties in calculating accurate F2 
PECs. In Section |III] a simple method is described that re- 
moves spin-contamination effects in AFQMC calculations. 
Realistic F2 potential energy curves and spectroscopic con- 
stants are presented in Section|IV] Finally, SectionlVjsumma- 
rizes and discusses our principal results. 



II. SPIN CONTAMINATION EFFECTS IN THE 
DISSOCIATION OF THE F2 MOLECULE 

The difficulty in treating the dissociation of the F2 molecule 
is already evident at the mean-field level of theory. The upper 
panel of Fig. [T] shows PECs from HF and density functional 
theory (DFT). UHF does not predict a bound molecule^ii^^ 
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FIG. 1: (Color online) PECs for F2 using the cc-pVDZ basis. The F- 
F internuclear distance, R, is shown in units of the equilibrium value 
= 1.41193AJ^ (Note the energy scales are different between 
the upper and lower figures.) Upper figure: mean-field results, with 
constant shifts added to the DFT energies so that all energies match 
the UHF energy at R/Re ~ 3.0. Lower figure: coupled-cluster 
results, both spin-restricted [RCCSD(T) and RCCSDTQ] and spin- 
unrestricted [UCCSD(T)]. The RCCSDTQ results are from Refll^. 
A Morse fit through the RCCSDTQ points is shown as a guide to the 
eye. Straight line segments connect the RCCSD(T) and UCCSD(T) 
points. 



while the restricted HF (RHF) curve is artificially bound with 
a minimum that is 5% too low compared to experiment. The 
DFT local spin-density approximation^^ (LSDA) and gener- 
alized gradient approximatior^ (GGA/PBE) yield dissocia- 
tion energies which are too large. The hybrid B3LYB2^i2^ 
dissociation energy is closer to experiment, but the shape of 
the B3LYP PEC is not correct in the intermediate region (see 
below in Sec. |IV| ). All our HF and DFT calculations were 
carried out using the quantum chemistry computer program 



GAUSSIAN98 



27 



The difficulty of treating F2 dissociation, even using cor- 
related methods, is illustrated in the bottom panel of Fig. [T] 
For the small (cc-pVDZ) basis se t^^i^^ chosen here, the spin- 



restricted coupled-cluster including up to quadrupole exci- 
tations (RCCSDTQ) is within reach, which is expected to 
give close to exact results in this case. The RCCSD(T) 
and UCCSD(T) calculations, done with GAUSSIAN98?^ or 
NWCHEMjiS use a single-determinant RHF or UHF |<I>,,), re- 
spectively. The RCCSD(T) method breaks down in the disso- 
ciation limit, where the RHF is very poor (as seen in the 
upper panel of the figure). The UCCSD(T) PEC is accurate in 
the dissociation limit, but its shape begins to be distorted near 
the equilibrium bond length and shows significant error in the 
intermediate region. 

-199.05 



-199.06 - 



-199.07 ■ 



-199.08 - 



-199.09 - 



-199.10 - 




□ RCCSDTQ 
AFQMC/UHF 



-199. llL 



1.0 



1.5 



2.0 

R/R 



2.5 



3.0 



FIG. 2: (Color online) AFQMC F2 PEC using a UHF reference state 
WF, compared to RCCSDTQ results from Fig.[T] The cc-pVDZ ba- 
sis set is used. A Morse fit through the RCCSDTQ points is shown as 
a guide to the eye. Straight line segments connect the AFQMCAJHF 
points. The large deviations at R/Re ^ 1.5 are due to spin contam- 
ination of the UHF reference state WF (see text). 

The AFQMC PEC calculated with |<I>™P) (labeled 
AFQMC/UHF) shows good agreement with RCCSDTQ near 
equilibrium and in the dissociation limit, as seen in Fig.|2] In 
the intermediate regime, however, AFQMC/UHF shows devi- 
ations of more than 20m£'h. The poor results in this regime 
are due to the AFQMC phase-free approximation^ when it 
is applied to a walker population that is spin-contaminated. 
The approximation, which depends on the accuracy of \^r), 
is analogous to that in the fixed node DMC method, whose 
performance depends on the accuracy of the |$,.} nodal hy- 
persurface. In view of the inability of 1$™^) to even bind F2 
and its poor quality in the intermediate regime, the inaccurate 
AFQMC results are perhaps not too surprising. 

A brute force approach to improve the AFQMC PEC is to 
use a better |$r), through the use of a multi-determinant ref- 
erence wave function. Indeed, using a generalized valence 
bond (GVB)''-' or complete active space self consistent field 
(CASSCF)ii 1$,) in AFQMC (labeled AFQMC/GVB and 
AFQMC/CASSCF, respectively) eliminates most of the er- 
ror, as shown in Fig. |3] The GVB WF is a perfect-pairing 
GVB(l/2) wave function, where the electron pair responsible 
for the chemical bonding in F2 (those in the 2pz <Jg orbital 
in RHF) now occupy a pair of nonorthogonal, -atomic- 
like orbitals. The GVB WF has the proper dissociation limit. 
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The CASSCF(10,12) is for 10 active electrons and an ac- 
tive space of 12 molecular orbitals. The CASSCF WF is 
truncated, retaining only those determinants whose weights 
(the square of the configuration-interaction coefficients) are 
greater than 4 x 10^''^. (The adequacy of this cutoff was tested 
by performing additional calculations including determinants 
with weights larger than 10^^ in the trial WF. Within statis- 
tical errors, QMC energies similar to those with the higher 
weight cutoff were obtained.) The computational cost in 
AFQMC with a multi-determinant scales linearly with 
the number of determinants, although the real cost is typi- 
cally less since abetter |<i>,.) reduces statistical errors;iiln the 
next section, we show that the improved AFQMC/GVB and 
AFQMC/CASSCF PECs are lai-gely due to the elimination of 
errors from spin contamination. 
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FIG. 3: (Color online) Improvement of the AFQMC F2 PEC us- 
ing multi-determinant reference state WFs from GVB(l/2) and 
CASSCF(10,12). RCCSDTQ from Fig.[T]is shown for comparison. 
Morse fits are shown as a guide to the eye. The inset shows energy 
differences /S.E (in miSh) compared to RCCSDTQ. All calculations 
use the cc-pVDZ basis set. 



III. ELIMINATING SPIN CONTAMINATION IN AFQMC: 
SPIN PROJECTION METHOD 

While the exact eigenstates of a spin-independent non- 
relativistic electronic Hamiltonian are eigenstates of the total 
spin operator S"^ and its z component Sz, approximate wave 
functions may not be eigenstates of S*^ unless special care is 
taken. Such approximate wave functions are called spin con- 
taminated. For simplicity, we restrict the discussion in this 
section to the case where the reference state wave function is 
given by |(E>™^) with Sz ~ 0, approximating the exact sin- 
glet ground state |<&§). In this case |$™^) will generally be 
spin contaminated, i.e., containing triplet j^*') and higher spin 
states: 
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FIG. 4: (Color online) Improvement of the AFQMC F2 PEC using 
spin projection with a single determinant UHF reference state wave 
function. Spin-projected (sp-AFQMCAJHF) results are compared to 
standard AFQMC/UHF without spin projection and to RCCSDTQ 
results from Fig. [2] Morse fits are shown as a guide to the eye, except 
for AFQMCAJHF. All calculations use the cc-pVDZ basis. 



where \^^) is a linear combination of the ground and excited 
singlet states. In the UHF result of Fig. [T] the expectation 
value of the total electronic spin operator in |$™^) is 
0.362, 0.978, and 1.004 at RjR^ = 1.0, 1.5, and 3.0, respec- 
tively, indicating a high level of spin contamination in which 
the triplet component grows as the molecule is stretched. 

Ideally, AFQMC projection of |$™^) would lead to the 
exact spin-contamination-free ground state, since 



^ Co|$g) +Cie-"^(^^-^o)|$i) , (2) 



1$ 



UHF 



) =C,|*^)+Ct|**)-f 



(1) 



where | $ 1 ) is the exact first excited state, r is the time-step pa- 
rameter, and as ri cxD, all components except |$q) become 
vanishingly small. The use of the phase-free approximation^^ 
however, effectively modifies this projection so that a triplet 
component can survive. Thus, the population of AFQMC ran- 
dom walkers will be spin contaminated if it was initialized 
with |4>™^). In F2, the presence of a nearby triplet stated at 
bond lengths R/ Re ^ 1-4 exacerbates this, and this is where 
the AFQMC/UHF PEC shows the largest error 

In the previous section, spin contamination in AFQMC 
was eliminated through the use of a (nearly) spin-pure multi- 
determinant \^r), which effectively filters the population of 
random walkers, retaining only the spin-pure component re- 
gardless of how the population was initialized. The GVB 
is spin-contamination free by design, and the truncated 
CASSCF is nearly free of spin contamination. The elim- 
ination/reduction of spin contamination in the GVB and trun- 
cated CASSCF is a main factor in the improvement of 
the corresponding QMC results. A strong clue to this is seen 
in the case with GVB \^r), where the GVB WF has only two 
determinants and has a variational energy within ^ 1 mi^h that 
of the UHF at R/R^ = 1.5 (see TablelB, and yet QMC/GVB 
greatly improves over QMC/UHF. 

A simpler way to eliminate spin contamination in AFQMC 
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TABLE I: Comparison of computed F2 PEC for various methods, using the cc-pVDZ basis. The RCCSDT and RCCSDTQ results are from 
Ref.|2^ Energies are in Eh. QMC statistical errors are on the last digit and are shown in parentheses. 



R/Re 





1.0 


1.25 


1.5 


2.0 


2.5 


3.0 


RHF 


-198.685670 


-198.612171 


-198.527711 


-198.419839 


-198.374025 


-198.355748 


UHF 


-198.695746 


-198.735754 


-198.747441 


-198.750892 


-198.750597 


-198.750518 


GVB 


-198.761466 


-198.759320 


-198.748618 


-198.743801 


-198.743570 


-198.743650 


CASSCF(10,12) 


-198.886738 


-198.874892 


-198.857896 


-198.850284 


-198.849831 


-198.849732 


RCCSD(T) 


-199.101152 


-199.084940 


-199.070790 


-199.081058 


-199.090213 


-199.093534 


UCCSD(T) 


-199.100100 


-199.075878 


-199.060126 


-199.059302 


-199.058784 


-199.058687 


RCCSDT 


-199.101417 


-199.084493 


-199.065170 


-199.057558 


-199.058023 


-199.057933 


RCCSDTQ 


-199.102961 


-199.087149 


-199.068153 


-199.0594 


-199.05884 


-199.05889 


AFQMCAJHF 


-199.1024(2) 


-199.0932(3) 


-199.0899(4) 


-199.0607(4) 


-199.0586(2) 


-199.0575(1) 


AFQMC/GVB 


-199.1040(3) 


-199.0878(6) 


-199.0673(5) 


-199.0567(5) 


-199.0544(5) 


-199.0540(5) 


AFQMC/CASSCF(10,12) 


-199.1042(9) 


-199.0842(9) 


-199.0663(6) 


-199.0560(8) 


-199.0580(8) 


-199.0562(8) 


sp- AFQMCAJHF 


-199.1020(5) 


-199.0876(7) 


-199.0686(7) 


-199.0574(2) 


-199.0558(3) 


-199.0562(5) 



is to ensure that the population of random walkers consists of 
spin-pure (i.e. RHF-type) Slater determinants, { 10^) }■ Al- 
most all phaseless AFQMC electronic structure calculations 
to date^iiSiliil have used Hubbard-Stratonovich (HS) trans- 
formations which preserve spin symmetry 



-25.195 



(3) 



where x denotes HS auxiliary fields, and the one-body oper- 
ators ?)|(x) and ?)|(x) have identical forms. For example, in 
the plane-wave formalismriH vus (x) is essentially a Fourier 
component of the density operator. Thus, if a random walker 
is initialized to a spin-pure Slater determinant with Sz = 0, 
its spin state cannot be modified by the QMC propagation 
\<j)'g) = e^""'^'''-' Typically, the same trial WF is used 
in phaseless AFQMC to generate the initial population, to 
guide the importance sampling, and to impose the phaseless 
constrainti^SilS This of course does not have to be the case. 
Here we use a spin-pure state to initialize the walkers. Since 
each walker in the population { \(f>s) } remains spin-pure, the 
local energy El[(I)s] projects out the triplet and higher com- 
ponents of |$™^): 



(4) 



The mixed estimator for the ground state energy is determined 
by the local energy, so it too is spin-uncontaminated. Thus, 
higher spin states have no effect on either the AFQMC projec- 
tion, the phase-free approximation, or the ground state energy 
estimation. 

The spin-projected AFQMC (sp-AFQMC) method de- 
scribed above shows a dramatic improvement over the spin- 
contaminated AFQMC/UHF in F2, as seen in Fig. |4] In the 
sp-AFQMC/UHF calculations, the walker population is ini- 
tiaHzed with the RHF solution 10^) = |RHF), but l^™^) is 
used to implement the phase-free constraint^ and to calculate 
the local energy. Table ^tabulates the energies for all methods 
using the cc-pVDZ basis. We see that the sp-AFQMC/UHF 
PEC is in excellent agreement with the RCCSDTQ result, 
with a maximum discrepancy of ~ 3 mE'h. This accuracy is 
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-25.220 



FIG. 5: (Color online) Slow energy equilibration as a function of 
imaginary time nr of spin contaminated AFQMC/UHF, compared 
to spin contamination free AFQMC/RHF and sp-AFQMC/UHF for 
the BH molecule at Re — 1.2344 A. The cc-pVDZ basis is used. 
To reduce clutter, QMC statistical errors are not shown in sp- 
AFQMC/UHF and AFQMC/RHF for nr > 15, but the average size 
of the error bar is indicated for each in the legend. FCI-derived RHF 
and UHF projection curves (see text) are also shown, calculated us- 
ing Eq. lU. The FCI ground-state energy is -21.216 249 Sh. 



in fact slightly better than that of either AFQMC/CASSCF or 
AFQMC/GVB. 

In addition to removing spurious spin contamination effects 
in the calculated AFQMC energy, sp-AFQMC can sometimes 
also reduce the imaginary time [see Eq. (|2])] needed to ob- 
tain energy equilibration. This is illustrated in Fig. |5] for the 
BH molecule. The spin-contaminated AFQMC/UHF has an 
equilibration time nr ^ 100 a. u., about an order of magni- 
tude larger than the spin-contamination-free sp-AFQMC/UHF 
(nr ^ 10 a. u.). For comparison, the curve from AFQMC 
using |<i>|^^'^) is also shown. Starting from the same ini- 
tial state, the the spin-contamination-free AFQMC/RHF has 
a short equilibration time similar to sp-AFQMC/UHF, but the 
converged result has a larger systematic error, because of the 
poorer quality of |$J*^^) as the constraining WF in the phase- 
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less approximation. The different behaviors of the equiUbra- 
tion time can be understood by comparing with FCI-derived 
RHF and UHF projections, which are shown in Fig. |5] We 
calculate the "exact" projection results by expanding the UHF 
and RHF initial WFs in terms of a truncated set of the FCI 
eigenstates (the first 80 eigenstates, obtained with GAMESS^). 
With a UHF initial WF, the long equilibration time is due to 
the presence of low-lying triplet components [see Eq. ([T])], 
which results in smaller effective gap (Ei — Eq) in Eq. (|2]). 
The RHF WF, on the other hand, has no overlap with any 
triplet state, and consequently the effective gap is larger and 
the equilibration time shorter 



TABLE II: The PEC of Ft, computed using spin-projected phaseless 
AFQMC with UHF trial wave function in cc-pVTZ and cc-pVQZ 
basis. Energies are in Eh- QMC statistical errors are on the last digit 
and are shown in parentheses. 



R/Re cc-pVTZ 



cc-pVQZ 



0.80 

0.85 

0.9 

0.925 

0.95 

0.975 

1.0 

1.025 

1.05 

1.075 

1.1 

1.25 

1.4 

1.5 

1.75 

2.0 

3.0 



-199.2321(4) 
-199.2805(5) 
-199.3061(6) 
-199.3134(4) 
-199.3186(5) 
-199.3210(6) 
-199.3211(4) 
-199.3209(4) 
-199.3190(4) 
-199.3176(5) 
-199.3136(5) 
-199.2962(6) 
-199.2797(6) 
-199.2731(6) 
-199.2632(4) 
-199.2603(6) 
-199.2590(6) 



-199.3372(5 
-199.3824(6 
-199.4099(3 
-199.4159(6 
-199.4218(7 
-199.4231(5 
-199.4241(4 
-199.4236(4 
-199.4219(4 
-199.4201(4 
-199.4170(4 
-199.3975(4 
-199.3807(3 
-199.3732(4 
-199.3641(2 
-199.3605(4 
-199.3593(4 



IV. REALISTIC F2 POTENTIAL ENERGY CURVE: 
BASIS-SET CONVERGED SPIN-PROJECTED AFQMC 
RESULTS 

We have shown that the sp-AFQMC PEC is accurate at 
the double zeta cc-pVDZ level, where near-exact CCSDTQ 
coupled-cluster results are available for comparison. As a 
function of bond stretching, sp-AFQMC delivers more uni- 
form accuracy than RCCSD(T) and UCCSD(T) for the diffi- 
cult F2 molecule, with absolute errors of a few mEh or less. 
In this section, we employ large basis sets to obtain a real- 
istic PEC. We also compute F2 spectroscopic constants and 
compare them with experimental results. 

Figure |6] presents the PECs of F2 computed using sp- 
AFQMC/UHF for cc-pVTZ and cc-pVQZ basis sets,^ For 
comparison, PECs from B3LYP RCCSD(T) and UCCSD(T) 
are also shown, representing the best current theoretical re- 
sults. (The B3LYP curves were shifted to agree with sp- 
AFQMC/UHF at R/Re = 3.) The sp-AFQMC/UHF energies 
corresponding to Fig.|6]are also tabulated in Table HIl 



Computed spectroscopic constants are given in Table |III] 
together with those from the many-body RCCSD(T) and 
UCCSD(T), and the independent-electron LSDA, GGA/PBE, 
and B3LYP methods. The spectroscopic constants were ob- 
tained by fitting the calculated PECs in the range 0.8 < 
R/Re < 1.25 to a three-term extended Morse curva^ 



E{r) = Eo 



4 

E 

n^2 



-a(r— Te) 



(5) 



The fitting procedure yields the molecular electronic energy, 
Eq = E{re), equilibrium bond length rg, and the harmonic 
frequency uiq = -^/C^/S/i, where /i is the reduced mass 
of the F2 molecule. The dissociation energy is given by 
De = E{3Re) — E{re). For comparison, De calculated from 
2i?(atom) — E{re), where i?(atom) is a well-converged en- 
ergy for the isolated atom, is also shown for the many-body 
results in the TZ and QZ basis sets. 

The values of sp-AFQMC/UHF re and ujo in Table HU] 
are in excellent agreement with experiment. The dissoci- 
ation energy De = E{3Re) — E{re) is overestimated, how- 
ever This is due to the overestimation of the total energy at 
large R/Re ~ 3, which reflects the deficiency of a simple 
UHF in AFQMC for open-shell systems, as previously 
notedjii To obtain a more accurate De, an AFQMC calcula- 
tion was performed for the isolated F atom with a truncated 
CASSCF(7,13) The 2s through M orbitals were in- 

cluded in the active space of the CASSCF WF. The trunca- 
tion retains determinants with weight greater than 2 x 10~*, 
resulting in a \^r) with 47 determinants. In cc-pVQZ, the 
atomic energy thus calculated is £■ (atom) = —99.6811(5) -Eh, 
while the corresponding RCCSD(T) and UCCSD(T) values 
are -99.681 704 and -99.681 576 -Eh, respectively. The dis- 
sociation energy obtained with De = 2-E(atom) ~ E{re) is in 
excellent agreement with experiment. 

The variations in the results from the TZ to the QZ basis 
sets are still visible but quite small (especially in re and loq). 
It is thus reasonable to expect the residual finite basis set er- 
ror to be small in the QZ basis. A simple extrapolation to the 
infinite basis Umitili increases Dg only by 0.02 eV (0.7 mEh) 
from the cc-pVQZ value. The shape of the sp-AFQMC PEC 
should thus be very close to that at basis set convergence. (In 
contrast, the residual error of the cc-pVQZ absolute molecular 
energies is approximately 110 mEh, estimated using nonrela- 
tivistic energies published in the literaturei^ill) 

Compared to the sp-AFQMC/UHF PEC, the RCCSD(T) 
and UCCSD(T) PECs in Fig. |6] show the same shortcom- 
ings as seen with the cc-pVDZ basis in Fig. [1] While the 
RCCSD(T) PEC near equilibrium is in good agreement with 
sp-AFQMC/UHF, it is very poor in the dissociation limit. For 
this reason, the RCCSD(T) dissociation energy De shown in 
Table |III] is computed only from De = 2E(atom) — E{re)- 
The UCCSD(T) PEC is accurate in the dissociation limit, but 
its shape is significantly distorted near equilibrium. Conse- 
quently, the UCCSD(T) spectroscopic constants are not in as 
good agreement with experiment. The RCCSD(T) re and 
ijjQ are also in excellent agreement with experiment, while 
the UCCSD(T) ljq is 13% too large. This is consistent with 
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FIG. 6: (Color online) PECs for F2 with cc-pVTZ and cc-pVQZ basis sets. A Morse fit passes through the sp-AFQMC/UHF points. Straight 
line segments connect results of the other methods. The B3LYP curves were shifted to agree with sp-AFQMC/UHF at R/ Re = 3. 



TABLE III: Computed F2 spectroscopic constants for three basis sets, together with experimental results. 

Expt° AFQMC RCCSD(T) UCCSD(T) LSDA GGA/PBE B3LYP 



Basis: cc-pVDZ 



re (A) 1.4131(8) 


1.467(4) 


1.4571 


1.4428 


1.3970 


1.4276 


1.4097 


wo (cm-i) 916.64 


725(36) 


785 


853 


1026 


958 


1033 


L»e (eV)* 1.693(5) 


1.293(7) 


1.180'' 


1.142 


3.454 


2.347 


1.638 


Basis: cc-pVTZ 














re (A) 1.4131(8) 


1.411(3) 


1.4131 


1.3987 


1.3863 


1.4138 


1.3957 


Wo (cm-i) 916.64 


928(30) 


926 


1022 


1065 


1001 


1072 


L»e (eV)* 1.693(5) 


1.70(2) 




1.493 


3.486 


2.351 


1.651 


De (eV)^ 1.693(5) 


1.60(1) 


1.523 


1.495 








Basis: cc-pVQZ 














re (A) 1.4131(8) 


1.411(2) 


1.4108 


1.3946 


1.3856 


1.4136 


1.3944 


wo (cm^^) 916.64 


912(11) 


929 


1036 


1062 


997 


1109 


De (eV)* 1.693(5) 


1.77(1) 




1.567 


3.473 


2.321 


1.634 


D, (€Wy 1.693(5) 


1.70(1) 


1.594 


1.569 









"The dissociation energy De is from Ref . [53 (the zero point and spin-orbit 
energies have been removed). The equihbrium internuclear distance is 
from Ref. [33. and the vibrational frequency ujq is from Refill 

*The dissociation energy calculated using E(3Re) — E{re). 

■^The dissociation energy calculated using 2i?(atom) — E{rg). In AFQMC, 
£;(atom) is calculated with a truncated CASSCF(7,13) l^r). 



Fig.[T] where the UCCSD(T) potential well is too narrow com- 
pared with the near-exact RCCSDTQ result. Curiously, the 
UCCSD(T) and B3LYP PECs show very similar deviations 
near equilibrium. 

As expected, LSDA, GGA/PBE, and B3LYP show more 
rapid convergence with basis set size than the correlated meth- 
ods. The B3LYP is good, but since the shape of its 
PEC is incorrect, luq is ^ 20% too large and the equilib- 
rium bond length is too small. (The large discrepancy here 
underscores the difficult nature of F2; in other molecules, 
B3LYP results are typically found to be in good agreement 
with experiment.'*-) Both LSDA and GGA/PBE have poor luq 
and De, while their equilibrium bond lengths are within 
~ 2% of experiment. 



V. SUMMARY AND DISCUSSION 

The accuracy of AFQMC depends on the reference wave 
function |<I>,.), which is used to implement the phase-free 
constraint;^ This is analogous to DMC, which uses a reference 
to impose the fixed-node approximation to control the 
sign problem. In previous applications, AFQMC was found 
to have less reliance on the quality of |$r), and frequently a 
single-determinant was found adequate. In these cases, 
the best results were obtained using the best variational sin- 
gle determinant reference state, namely the HE solution when 
RHE and UHE are the same (e.g., in the H2O molecule at 
equilibrium"^), or the UHE solution |$™^) when the two dif- 
fer. Moreover, the AEQMC method seemed relatively insensi- 
tive, within the spin unrestricted framework, to whether a HE, 
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DFT, or hybrid B3LYP Slater determinant was used as |$r) -^ 
In some cases, however, such as the BH and N2 molecules, 
achieving quantitative accuracy of a few mEl, for the entire 
PEC required multi-determinant |$r)'^ 

It is shown here that, with |$™''), the AFQMC PEC of 
the difficult F2 molecule is qualitatively incorrect in the in- 
termediate dissociation region. Spin-contamination is iden- 
tified as the primary source of the error. We have intro- 
duced a simple scheme, sp- AFQMC, that effectively removes 
spin-contamination effects, regardless of the choice of |$r)- 
It is also illustrated how spin projection can often shorten 
the AFQMC equilibration time. F2 calculations with sp- 
AFQMC/UHF were shown to give a PEC whose accuracy is 
better than a few mEh across the entire curve in the cc-pVDZ 
basis. To our knowledge, these are the most accurate results 
obtained by a theoretical method that easily scales up in sys- 
tem size. The full PEC curves from equilibrium to the dissoci- 
ation limit were then calculated with cc-pVTZ and cc-pVQZ 
basis sets. Spectroscopic constants with the cc-pVQZ basis 
were found to be in excellent agreement with experiment. 

The sp- AFQMC results with a single determinant |$™^) 
are comparable to those obtained with a multi-determinant 
\<^r) trial WF from GVB or CASSCF The spin-projection 



method thus further reduces the reliance of AFQMC on the 
choice of 1$, ), which is one of its most desirable features. 

While the focus has been mainly on AFQMC using a single 
determinant |$™^) reference wave function, the method may 
also prove useful with multi-determinant 1$, ) with significant 
spin contamination. This could arise, for example, in treating 
correlated transition metal systems with truncated CASSCF 
wave functions. 
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